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ABSTRACT 

In special education, a merger between assessment and 
instruction is mandated by Federal law (PL 94-142), wherein teachers, 
of handicapped pupils are required to specify individualized 
educational programs that include procedures for assessing students' 
progress toward goal attainment. The purpose of tKis study was to 
deterfnine the effects of technically adequate, repeated 
curriculum- based measurement and evaluation procedures on students' 
reading achievement and on their knowledge concerning their own 
learning. Thirty-fline teachers, each having three or four students in 
the study, were assigned randomly, to measurement/evaluation gtoujTs. 
Experimental group teachers employed continuous evaluation procedures 
'while contrast group teachers employed conventional spedicil education 
measurement and evaluation procedures. Students, were pre- and 
posttested on an oral passage readiog^ test and tbey were posttested 
on the Stanford Diagnostic Reading Test,^ Structural Analysis' and ^ 
Reading Comprehension subtests. Additionally, students were 
interviewed at the end of the study to assess their knowledge about 
their own progress. Analyses revealed that experimental students 
achieved, more than contrast group students and were more 
knowledgeable about their learning. Results suggest that technically 
adequate, repeated curriculum-based measurement, when used by 
teachers to evaluate and modify programs, positively affects student 
achievement. Rating scales, an interviey; form and training schedules, 
are appended. (AuthOr/PN) . 
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Abstract • ' 

The pufposfe of this study was to determine the ef^cts of 
technically adequate, repeated curriculum-based measurement and 
evaluation procedures on students' reading achievement and on their 
knowledge concerniag their own learning. Thirty-nine teachers., each 
having three to four students in the study, were assigned randomly to 
measurement/evaluation groups.. Experimental group teachers employed 
continuous evaluation procedures while contrast group teachers 
employed conventional special education measurement and es^aluation 
procedures. Students were pre- and posttested on an ora\ passage 
reading test and they were posttested on the Sta^nford Diagnostic 
Reading Test, S.tructural Analysis arid Reading Comprehension subtests.^ 
Additionally, students were inter?viewed at the end of the study to 
assess their knowledge about their own progress. Analyses revealed 
thatvexperimental students achieved 'more than contrast group students 
and were more knowledgeable about their learning. j Implications for 
special education programming and assessment are (Jiscussed. , ^ 



Ejects of Frequent Curriculum-based Measurerfient and 
Evaluation on Student Achievement and Knowledge of Performance: 

An Experimenta^l' Study 

the learning principles of educational psychology (Crow & Crow, 

* 

1963; Farnham-Diggory, 1972; Gagne, 1964) provide a theoretical 
framework for integrating measurement and evaluation with instruction. 
Properly conducted assessment provides at least tWo types of data that 
may be useful in enhancing achievement*. Firsts assessment can help 
teachers^ better structure curriculum and teaching strategies to 
address' s,tudents'' individual needs and rates of progress. Second, 
measurement and evaluation can constitute an important source of 
feedback to- a learner and can help the learner to see graded series of 
accomplVshmerits and to recognize strategies for futur-e goaV 
attainment • , 

In special ^^ucation, a- merger between assessment and instruction 
M mandated by Federal ,.law/(PL 94-14;),* wherein teachers of 
handicapped pupils 3X0. required to specify Individualized Educational 
Programs (^EPs)' that include procedures for assessing students' 
progress toward goal attainment. Substantive compliance with this law 
dictates that both mea'suring and evaluating student progress be on- 
going so that the adequacy^of individual programs can be monitored 
contiguously an^ improved as required (Deno & Mirkin, 1979). A small 
body of research (.Beck, 1981;/.Haring, Maddux, & Krug, 1972; Mirkin, 
Deno, Tindal,*& Kuehnle, 'l980) has begun to accumulate titat supports 
the hypothesis thai such on-gping measurement and evaluation may, in 
fact, facilitate student TicfvPevement. In the fol lowing^ discussion, 
the effects of fre'diuent testing and the, effects of on-going evaluation 



are reviewed separately. Thenrunif led systems for integrating testing 

and evaluation with instructi-on are discussed. Finally, the purp6se 

of the current stiidy is presented. 

■ Effects of Frequent Testing , • 

^ .Frequent testing appears to affect student achievement. One of 

the earliest 'published studies on the relation between achievement and 

testing was conducted by Gates (1917), who found that retention of rte'w 

information was greatest when the presentation of each information 

unit was followed by test. Gates' research set the stage for a 

series ' of' studies' investigating this notion. These studies (Jones, 

;.1923; Keys, 1934; Noll, 1939) tended to support Gates' original 

findings. Ear.ly research on the positive relation , between student 

learning and frequent testing has been corroborated by more- recent 

» 

'studies. Robin (1976) found that frequent testing, accounted far jnope. 
variance in student achievement than any other variable in Keller's 
(1968) Personalized System of Instruction. Similarly, Omelich and 
Covington (1981) demonstrated with a path analysis that frequent 
testing was the factor that contributed most to pypil learning in* the 
behavioral paradigm of direct, ftj'equent assessment. 

Frequent testing appears to enhance not only achievement but, also 

"studei^t motivation, -s^ldhusen (1964) studied pupils who were 
administered 14 weekly' quizzes^ that counted cjf one-fourth of their 

'class grades.' At least 8055 of these students .reported that frequent 

, quizzes caused more learning, .motivated them to study more thain usilal, 
and were effective in checking self progress.,'" Of those students who 
reported that .quizzes caused anxiety, 87% approved of th5t anxJety 



because they' believed that "it helf)ed them to learn. 

Frequent testing also improves s*tudent motivation by .providing 
feedback that can generate confidence in s^uflents regarding their 
ability 'to learn (Peckham & Roe, 1977) • X it cin enhance an 
Individual's' perception of self-efficacy that can, j'rt t\rn, influence 
thought patterns, actions, and emotional arousal that l^ead to goal 
attainment (Bandura, 1982; Prentice, 1961). Therefore, research 
supports the notion that frequent testing is related to students* 
growth, their attitude and approach to learning. 
Effects of Continuous Evaluation - ' ' 

Unfortunately, it appears that frequent testing may be an 
insufficient condition ^for enhanced student development. ; Evidence . 
suggests- -that teachers who collect stbdent performance data^do not 
necessarily use those data to make instructional decisions or to 
provide ^feedback to students , (Baldwin, 1976; White, 1974). 
Additionally, when teachers fail .to evaliiate^ student 'test data 
continuousTy and systematically, iBarning is not improved '(Tindal, 
Fuchs, Christenson, Mirkin, & Deno, 1981). 

Attempts to Integrate Frequent Testing and Cogtinuous Evaluation into 
Instruction • 

To inprease the probability that students and teachers will use 
frequent testing results to influence student groi^th, educational 
psychologists have developed comprehensive systems for repeated 
testing and systematic -evalaation. Keller (1^68), for exantple, 
djEfVeloped * the Personalized System of Instruction' (^Sl), where 
individuals (a) pr^ogress at thei'r .own paces, (b) are teste?!, repeatedly 



until unit perfection is attaiined before being permitted to advance to 

new material, and (c) in§efj^ct primarily with proctors whd administer 

te'sts, provide instructiony^nd deliver immediate feedback. 

Bloom (1971\ has developed the model of mastery learning, which 

is ta^ed on Carroll's (1963) assumptions that learning can be broken 

down into a series of small tasks and that all students will master a 

given task provided they are given adequate time and instruction. The 

procedures for this model designate that consistent feedback be 

provided to students and teachers, and that instructional correctives 

♦ '» 

4 

also be provided as^required. As with Keller's PSI, frequen;t testing 
and f^back are int^egf aV parts of this learning model* 

QtTOr , educational psychologists have built upon previous work by 

If 

incorporating into. Keller and Bloom's basic principles the operant>^ 
methiodology 'of repeated behaviw sam'pling and time-series analysis* 
Uovitt (1977), L^ndsley (1971), , and WhiFe- and Harin^ (1980) have, 
developed motfels ^.direct, frequent testing and evaluation where a 
teacher collects repeated short samples of a student's behavior within 
a curriculum,*^ over a time period, and ^ under diff^ent teaching 

strategics. The' teacher then applies the methods of time^series* 

* '* ' . 

analysis to the data in order to determine the effectiveness of * 

specific program dimensions. , 

Unfortunately, all^of the**CLirricu,lum^-H)ase'3 testing and ev:aluati'on 

models ^referenced abgv-e ,r^^uire teachers continuously to create their 

own^ testing m£(terials; / little ] i^$ -known regarding *the„ technical 

characteristics of such self-prepared measurement and testing 

procedures. As Popham (1980) illustrates, creating a valid, reliable 



criterion-referenced. test is laborious, difficult, and time consuming. 
Additionally, since variations in testing, procedures have a dramatic 
bearing on the adeque^cy of ^nor*m-referenced standardized tests 
(Sattler, 1974), variations .in procedures for repeatedly testing 
student performance on currrculum objecti^ves also may affect the 
technical adequacy of the measurement. 

Therefore, while the development of curriculum-based testing and 
evaluation procedures may address the need to integrate me*asurement 
and evaluation with instruction, some models may have neglected the 
need for edumetrical ly adequate testing and evaluation instruments and 
procedures. Such approaches to^easurement may create unreliable and 

invalid data bases, with which to, formulate educational .decisions. 

» 

In response to the need for technically adequate measurement and 
evaluation, Mirkin, -Deno, Fuchs, Wesson, Tindal, Marston, and Kuehnle> 
('1981) developed a model of curriculum-based measurement and time- 
series evaluation that^ includes five decision-making steps* At. each 
step, the procedures (described below)' demonstrate edumetric adequacy 
and. logistical feasibility (Fuchs, Mirkin, Deno, Marston, t Tindal, 
1982). . 

Step 1; Selection of the program objective. The 
program objective specifies the date and the segment of the 
curriculum on which the student will perfdrni, and the 
specific level of ,,,prof i.ciency. So, a student's reading ■ 
'• objective might specify: By, June 2, when presented with 
s'tories from page 29 to- page 307 in Level U for one minute, 
George wilT read aloud at least 70 words correctly with 7 or 
fewer errors. In spelling, a student's objective might 
read: By February 15, when dictated words from Level, 9 for 
two minuses, Henry will write at least 10 words Correctly 
with no morethan 2 errors. 

Step 2: Selecttfon of the measurement domain. The- 
measurement domain is the set of material fr6m which test 



10 



samples arefdrawn'. Once an object'ivg has been established, 
tliS fDeasurement domain is^ defined as that level in the 
purrictJlum where the student, is expected to be proficient. 
Consequently^ given the above reading objective, the 
measurement domain would be reading passages from pages 29 
to 307 in Level 11, ■ Given the above spelling Objective, the 
measurement domain would be the words from Level 9, 

Step '3: Selection of the measurement task and 
procedure > The measurement task 'is a randomly selected 
portion of the measurement domarn, ' For example, using the 
objectives cited above, .ai^'passaga randomly drawn from pages 
29 to 307 would be the measurement task tn reading; a set of 
words randomly drawn 'from Level 9 would.be the measurement 
task in spelling. Each time measurement occurs the stimulus 
material changes; a new sample from *the domain is drawn 
r^indomly; The measurement procedure varies across academic 
areas but, within a -g^iven area, / ^the ^ procedure . is 
standardized across days'. ' (See mrk in et.iil., 1981, for 
recommended • procedures in reading,' spelling, written 
expression, and social bel^avior,) . Student performance is 
measured at least three times per week; measurement occurs 
.daily if possible. 

Step 4; Selection of a time-series 'data display . 
Student performance data are displayed on a'Cartesian graph, 
with "school days" labeled .on that horizontal - axis and 
"behavior" on the vertical axis. A data -point represents 
the student's performance leveV'on the measurement domain 
for a g-iven day. An exWle of a graph is provided in 
Figure !• In this inst;ance, the graph repi^esents the 
"Number of Words Read Aloud from Text Passage$ in Level 2 
Per Day." ^ Increases in the data points across ' the 
measurement days' indicate reading improvement and progress 
toward the student's objective. Each solid vertical lipe of 
the graph repre*sents the introduction of a prograKcH'finge. 



, In^sertf igure l ^Lbput here 

' • r---------- - ■ 

Step 5; Selection of ^ajba evaluation procedures . Two 
alternatives to data analysis exist, experimental and 
therapeutic analysis*. In experimental analysis, program 
changes are introduced ^.regularly and are treated as 
experimental hypotheses concerning their effect on a 
student's performance. The methods of time-series analysis 
tSidman, I960) are employed to summarize' and interpret 
student performance data. , Within program phases, data -are) 
summarized with a split-median solution trend line .(White, 



1971)^ an index of.' variability such -as total bounce.' 
(Pennypacker, Koenig, & Lindsley, 1972), and an index of 
step-up or step-down. Data interpretation is relative; 
effectiveness of a program phase is evaluated by comparing 
descriptive statistics across treatments. Therefore, 
changes in median, level, trend, and/or variability between 
adjacent phases are inspected and interpreted to formulate 
decisions about the effect iA^ervess of' a specific program 
change. * - . 

In therapeutic analysis, the student's, objective is 

• , marked an. the graph with an X at 'the intersection of the 
desired performance level and the expected* attainment date. 
Then, a line of required progress, connecting the student's 
baseline median score with the desired level* is drawn on the 
graph. ' This line is called the dynamic aiml ine. • Data 
interpretation involves the application" of the fol lowing 
rule: If 3 consecutive data points fal!- below the trend 
line, -then the program is judged ineffective and a change in 
the program is introduced. ^ , * 

Evidence (Fuchs, Mirkin, Deno, Marston, & Tindal, 1982) 
suggests that a combination of the two data-util ization 
methods, which borrows more heavily from the therapeutic 
method, may be most useful and. feasible. In this approach, 
recommeaded by Mirkin et al. (1981), teachers draw a dynamic 
aimline on the graph. Then, a* split-median trend Jine on 7 

• to 10 student {performance data point? is graphed and ^ 
compared to the slope of the dynamic aimline. If the student 
performance slope is less steep than the aimline, a program 
change is introduced. 

With this model , technically adequate measurement^ and 
evaluation are integrated with instruction. Program 
development is cumulative^ data- based, and on-going. Time- 
serie^ displays of student performance dati^ are* inspected; 
evaluations 'concerning the effectiveness^ of .individual 
programs are made; successful program changes are'rnaintained 
and incorporated into new program phases, while unsuccessful ' ^ 
Changes are terminated. In thfs way, technically^ adequate, 
curniculum-basetf tiipe-series measurement and evaluation may ^ 
provide an adequate 'data base with which educjitors can / 
' evaluate and improve individua*l programs formatively, to 
increase the likelihood that student objectives will be 
- realized; . / v 



Purpose of th.e Study 

The purpose of the study described- here was to determine the 
educational e'ff.ects of the. technically adequate, repeated curriculum- 



based measurement and evaluation procedures developed by Mirkin^et aL 

(1981) . ' -Specifically, fn special education programs of a large 

metropolitan school district, the effects of such measur'ernent and 

evaluation on students*., reading* achievement and their knowledge 'about 

their own Teaming were explored. * \ 

• Method 

_ • ' 

Subjects'" * J 



Subjects were 39 New York Cify publiia school teachers who 
volunteered ^to particiiSate in the project. From their caseloads, 
teaichers each selected 'three ,pr four students for inclusion in fhe 
study; .then, within schools, each teacher was assigned randomly to 
either an experimental or a contrast group. Four teachers In each 
group had panticipated in a pilot program during the previous academic 
year. \ - ; 

' In^he experimental grG^lp, teachers (3 male, 15 female) had 
taught spec i a 1_ .eduction for an average* 3.79 ;years (SD = 2.85). 
Student's in the experimental ■ group ~(5i fiiciTig; 13^-f€maJe.)- .read ' an 
average 3.48' years below grade level (SD = 1.87), Their age 
9ippropriate grade averaged 5.7^ (SD = 1.66); 49% were 'placed in 
programs for emotionally handicapped students, 32% in. programs for 
brain-injured students, and 19% in resource programsi * 
Contrast group teachers {2 male, 19' female) had taught 
handicapped children for an average 3.59 years (SD =2.72). Students 
in the contrast group (57 male, 20 female) read an average 2.35 years 
below grade l^vel (SD = 1.94)'. Their a|e' appropriate grade level 
averaged 5.45 (SD =1.65); 51% were placed in programs for emotionally 

. ■■ ■ ■ ■ ' 13 ■ : 



handicapped students, 3Q%'^n resource programs, and 53^ equally 
distributed across programs for brain-injured,- physically-handicapped, 
and edlicable mentally retarded children. . • 
Measures * * 

' Accuracy .of iifiplementaJbion rating - scale (AIRS) . The AIRS 
(Wesson, Deno,^ Maruj'ania, Sevcik, ^ Skiba, King, & Mirkin, 1982; see 
Appendix A) was employed to measure the accuracy with which the 
experimental procedures (Mirkin ■ et • al . , 1981) were implemented. The 
first two items of the 13-item rating instrument. Administering the 
Measurement Task and Selecting the Stimulus Material, are completed 
following direct observation of student-teacher interaction. The next 
five items. Sampling for Instructional Level, Baseline, Graphing 
Format, Aimline, and Timing of Instructional Changes^ are completed 
after, inspecting the Istudent graph (see Appendix- B),. The Long-range 
Goal, Short-term Objective, and Measurement System items are scored 
after inspecting th€ Goal and Object ivr'Ti3rm--(-s^_Appe,ndt5C--{lL^ .The 
Tnstrqctional Plan? Substantial Changes, and C>ear Changes scores are 
based on inspection of the Instructional Change Form (see Appendix D). 
Foi? descriptive purposes, eight items "were include,d in one, groUfJ 
called Measurement since they all 'related to -establishing and 

4 

conducting measlirement (of. Wesson *et al.^ 1982). 

Student interview .^ "A fbur-it«m int^view schedule was designed 
to assess .students' knowledge of (a) their reading progress, (b) their 
reading goals, and (c)- the likelihood that they would attain their 
reading goals during the current academic year. The last two items 
requir*ed interviewers to assess the accuracy of student responses 

14 . 



against graphs and records* , ^*See End-of-year Student Questionnaire iit ^ 
Appendix E.) 

Passage reading test > A series of three reading passages from a 
third grade book of the Ginn 720 reading series (GinR & Company, 1976) 
was employed in measurement. 'Two passages were sampled randomly .from 

•the text' and one passage was chosen to represent the readability level 
of the last 25% of the text. (See -Fuchs & Deno, 1981, for the 
selection procedure.) - ' ' ^ 

Stanford Diagnostic Reading Test . Two subtests. Structural 
Analysis and Reading Comprehension, of the Stanford Diagnostic Read.ing 
Test (Karlsen, Madden, & Gardner, 1976)', Green Level, Form- A* were 
employed as measures. The Structural Analysis subtest (SA) measures a 
child's decoding skills through the analysis of word parts. The SA 
asks pupils to (a)- identify the first syllable of two-syllable words, 
and (b) find, meaningful word parts and blend ,those parts into real 
words. Internal consistency reliability for the SA, Form A, was .93 
Tor grade 3 students, and .95 for grades 4 and 5 children. Criterion- 
related validity with respect to the reading tests of the Stanford 
Achievement Test ranged from .62 to .85, with , a correlation 
coefficient of .85 between the SA and the Total Reading Score on the 

j^nfor^ Reading Tests > 

' The Reading Comprehension subtest (RC) assesses -literal and 
inferential comprehension through short reading passage's presented in 

r 

a multiple choice format . and through short passages , followed by 
q'uestions. The passages/represent different subject matter areas at ^ 
third through fifth grade readability level. Internal consistency 



■ • • . , . ^ 11 

reliability was reported at .96, TOr the- RC, Form A, at grades 3 . 
through 5. Criterion validity wi.th respect to the »*eading tests of 
the Stanford Achiev ement 'Test was reported, .with correlation 

.... ' F 

* , ' ^ ■ . 

coefficients ranging from .68 to .90* . * • 
ProcedurB - , ' 

Pretest . - During November^ students were pretested on the third 
grade Ginn 720 reading passages,, with on^ ^passage'^administered each 
week during three successive weeks. Each student; was, ^tested • 
Individually by one of three teacher trajners who were skilled rn the . 
reading passage administration procedure (see Mirkin et aU, 1981). 
In this reading passage test, pupils read orally for one minute; words 
correct and errors per minute were scored. . , 

* Treatments . The experimental treatment is described in- 
Procedures to Develop and Monitor Progress on lEP Goals (Mirk in et 
_al., 1981). Employing 4ese procedures in the arga of reading, the 
experimental group teachers first wrote curriculum-based lEP goals anci 
objectives. The annual goal specified the segment of the curriculum 
and the date on which a .student would read at a certain rate and 
accuracy. The objective \:ontained' supplementary information; it 

'I ^ • 

■indicated the week Ty' rate *at' which the student would improve in order 

to meet the annual goal (see Goal and Objective form in Apjiendix C). 

After setting goals and objectives, teachers developed curriculum- 

based measurement systems to match specific goals and objectives. 
* " * 

. Tfien, th.ey were to measure sttidents .at least twice weekly and utilize 
those data to determine when to intr'qdtice program changes to increase 
the probability that students would' achieve' their goa.l^. By January 



12. ' 

4, all teachers had written goals* atid; objectives, and were measuring 
and graphing student performance. - By February 10, all teachers were 
employing data utilization rules to determine when to make 
instructional changes (cf. Mirkin et al., 1981). 
^ A sample Goal and .Objective Form, Grap.h, and Instructional Change 
Form for Michael appear in Figures 2^ 3, and 4, respectively. The 
Goal and'Objeclive Form states that, in the 19 weeks remaining in the 
school year;. Micha^.l .will .improve his reading in Level 2 of the SRA 
passages so that Ke 'reads 85 words correctly per minute with no more 
than 8 Vrors {see * Figure 2). This annual goal represents 
approximately a 2.5 times improvement over Mich.aeTs baseline rate of 
35 words per minute (see Figure 3). This annual goal then was divided 
into 19 weekly improvement rates; Michael's objective . s^tates that he 
will improve at an average Increase of 2.6 words correct /each week 
with no increase in Errors. 

" " " " " — — — — —— — — — — — — ^ ^ 

Insert Figures 2-4 .about here 

Michael's graph {sjee Figure. 3) represents his words read .per 
minute each school day in Level, 2 of the SRA passages. The first 
three' days of data depict Michael's baseline performance; the diagonal^ 
^line conne^cting the baseline median performance with the X is^ the 
dynamic aimlina, which represents Michael's objective or the d&Uy 
rate at which he must perform in order to rtleet the annual ^go§J^ 

The vertical lines on Michael's graph signify the introduction of 
•program changes, which are indicated by the letters B, C, and D. Each 
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prograrp change is' derailed on the Instructional Change Form (see 

« * 

Figure 4). At least two times each week;» Michael read from a randomly 
selected passage -from Levfel 2 for one minute; the number of words he 
read correctly &nd the number .of errors were scored and graphed on 
Figure 3; Every 7 to 10 data points., Michael's teacher inspected the 
graph. If 'the slope calculated by the split median solution (White, 
1971) of Michael's performance, across the 7 to' 10 day^ vfts less than 
the slope of the dynamic aimline, then the teacher introduced a' 
* program change. As Figure 3 illustrates., MichaeVs performance 
improved dramatically over his previous performance with the 

introduction of the fch1H4l program change. 

*♦ . " ' 

'.With, the . 6oa,l and Objective Form, the • Graph, and .the 
Instructional Change Form, Michael's educational prog»^ and progress 
toward goals were evaluated formatively. In -response to^'me-asurement 
data, Michael's program was modified throughout the treatment to 
•improve. the^ likelihood t)iat Michael would achieve his annual goal. 
.In the contra^ treatment, ; teachers measured and . evaluated 

^;students* j^cogress toward goals using conventional^ special education 
proqedures; these typically include- relatively* infrequent teacheKtnade- 
tests and informal observation (Fuchs, Deno, & Mirk in, 1982; Fuchs^ 
Fuchs, & Warren, 1§82). 1-^ ' * "\ ' 

TralnihqV '" Each of thr^e' .teacher -trainers (TTs), was assigned to a 
set of schools and, to the experimental and contrast teachers within 
that set of scho^s. TTs provided training to tejjqhers during weekly 
meetings, fr9rliovember through May. During the^e me6tin*gs, TTs taught 

, the^^treatment procedures to the experJni^ntal group ^eachers. They 



taught the contrast teachers strategies for diagnosing and treating 
learning and behavior problems, for .structuring and managing their 
instructional gr'oups, . and for using audio-visual equipment and 
paraprofessionals. ^The TTs spent similar amounts of time with* bpth 
groups of* teachers: Experimental group teachers received an average 
23.52 hours of . ind,ividual meeting time (SD = 5. 9SJ), .whereas contrast 
group teachers met individually wit4i the trainers for an average 20. BO 
hours (^SD = 6.22). A t_ test oi^ the , difference between the hours of 
training received by ^ the two groups revealed no statistically 
significant difference. (See Appendix F for sample expe'rimental and 
contrast training schedules^ 

^Data 'collection . TTs were taught ' to measure expferimentjil 
teachers' performance on the AIRS dur^in^ one five-hour training^ 
sesstfen. In January, AfJril, and" late May, a TT observed a randomly 
selected student for each experimental, group teacher while the 
*edocator was preparing for »and measuring]^ th? student's reading 
behavior. Immediately following an observation, the triiiner scored 
items 1 and 2 of the AIRS. Jhen, the trainers collected the 
experimental students' Goal and Objective Forms, Graphs, and 
Instructional Change Forms,, and gave t-hem to IRLD staV who used the 
documents to score itemS'^S through 13 of the AIRS. 

During June> students were tested in groups of 4 to 10 children 
on the Stahford' Diagnostic Rea^nq Test > SA and RC' subtests.' 
Additionally,* the three third grade heading passages were administered 
individuality in standard fashion (Mirkin et al,, 1981) during one 
session. All testing was conducted by thV TTs. In June, a subsample 
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of students was interviewed by the trainers the 'End-of-the-year 

Student I ntervievT Schedule. , ' • • • . 

f» * 

Data Agalysis ' 

Because teachers, rather thaii students, were assigned randomly to 

. . ■ ^ ^ . '\ > ' 

treatments, student reading scores for each teacher on each dependent 

measure were averaged, " arid "teacher" was , designated as the 

experimental unit ,of .analysis. Compalri'sons of the pretest oral 

reading error scores of the two groups revealed a statistically 

significant difference favoring the contrast group,' ^(37) = 2,03, £< 

,05 (mean difference = U95), The reading posttest scores, therefore, 

were analyzed by means of a two-way multivariate' * analysis of 

i Covariance (MAI^COVA) as Well as follow-up two-way univariate analyses 

of covariance (/^NCOVAs). The' experimental factor was' 

measurjement/evaluatioD treatment and the blocking factor wa^ teacher 

trainer; the j)retest error .variable was entered as the. covariate in 

each aoalysis. Prior to using the a,nalysis .of covart^nce, the 

assumption of homogeneous regression coefficients was tested and 

scattergrams were inspected; assumptibns of equal regression slopes 

and^j^ linearity of Y on X appeared tenable. 

Responses bn the End-of-the-year Student Interview were, analyzed 

employing chi-square tests. Measurement/evaluation treatment and ^he 

relevant item on the survey were the two d^men.sions of each 

contingency table. t'The chi-square statistic ^ was selected ^ over 

correlations because of the nominal, dichotomous nature of the 



measurement/evaluation factor. » 
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16 ^ . . ' * 

' ^ . Results » ^ - 

The analysis of ^^the results atidressed three questions. First, to 

what degree did the te£(chers implement the measurement and evaluation 

system? Second, did the^ students of the experimental and contrast 

teachers cjiffer in' their reading achievement? Finally, did the 

students of the experimental and contrast teacfiers differ with respect 

to their knowledge of their reading goals and progress? . . 

System Implementation . ^ • ^ 

There were three' Indices of the -extent to wfiteh, and the accyracy 

w1t,h, which, the, experimental 'teachers ^actually implemented the 

measurement;^ and e^valuatioh system: the AIRS,; the weekly number of 
' * ' ^ ^ . '* ' ' ■ . \- * ' 

data 'points on students '"graphs, and the number of changes teachers 

^ introduced into their-studtents ' programs. * ^ 

Table 1-displays qieans and standar^ devlatlqns^foJi.t.he jth"^^^ 
collection times on the 13 Items of the'AIRS, the Measurement gr^oup pf 
items, and the average across the Measurement gro^Up and the other Vive 

^ items (the; overall total), For the overall total, ratings of, the. 
teachers on the 5-point scale 'v/ere 3^32, 3.66, /and 3-85, respectively, 

/ for the three data collection times. 



Insert Table 1 about here 



. Teachers measured students* reading performance an average 2.14 
pflies per week (SO = .44) during the first six weeks of the treatment. 
They measured students' reading performance an average 1.87 tifnes'per 
week (SO = .4.0) during the second six weeks, and an average 1;37 times 

2i 



per week (SD =,.77) during the last six weeks. 

Over the 18-week .treatment, teachers introduced" a ^median two 
modifications in their students* programs,^ In Table 2, a frequency 
count of teachers introducing different numbers of changes is shown. 
The mean, standard deviation, and median number"^ changes introduced 
during each trimester of the stucjy also are displayed. Teachfers 
introduced an average .95, 1.01, and .22 changes into students' 
programs during each trimester^ respectively.^ 



Insert -Tajl^e 2 about here 



\sti|j( 



d'ent Achievement 



The. two-way multivariate analysis of covariance was conducted on 
^the four ;4iQSj;test TGja^t^ variables,: the mean number of words correct 
per minute on the passage reading test, the- mean number of errors per 
minut-fe -on the passage reading test, the rav< score on the SA, and the 
raw score on the RC. Wilk's lambda criterion was used to test for 
equality of group centroids.* The value calculated with the Wilk's 
.lambda .procedure w^s transformed into an JF. value through Rao's 
approximation. The test of lambda produced £ values that were 
statistically .significant for the measurement/evaluation factor, 
'£(4,29) >= 3.80, < .05, and for the. teacher trainer factor, £(8,58) = 
1'2.99, ji< *091.; Therefore, univariate two-way ANCOVAs were computed. 

These ANCOVAs revealed^ statistically significant differences 
between the measurement/evalu'ation conditions on three of four 
'dependent variables. . *Ta|)le 3 displays, .for each measurement 



evaluation group ''an^ for eaehydep'e^ident variables, the mean, the 

V 

standard devjatidn, the mean adjusted f or - the . independent and 
covariate, and the and £-vaiue^. 

Insert Table 3 about here 



• The univariate ANCOVAs included teacher trainer as a second, 

JiloclC'ing factor to increase the statistical 'power of ttie analysis 
« • 

rather than to explore an intei^estin^, related research question. 
Therefdre, although there were statistically significant , differences 
between teacher trainer groups, on* two of four variables, those results 
are not presented further or discussed. The absence of any 
statisticany s-ignif leant interaction between . the measurement 
evaluation and teacher 'trainer factbrs renders any .additional 
discussion of the teacher trainer conditions extraneous, to the 
purposes- of this paper. 

» 

students' Knowledge about their Learning . . • 

A final set of analyses was conducted to ascertaiif differences 
between experimental and contrast conditions with respect to student 
knowledge of thetr own goals and progress. Figure S is "a display of 
the perce'ntages of experimental and contrast group student responses 
when they were asked during the End-of-the-year*^ Student interview, 
"What kind of progress have you made in reading this year?" As one 
can see., thestrend of respqnses, for each group was similar. A chi- 
square analysis on the proportional responding' of experimental "and 
contrast groups to this question revealed no sfigniflcant difference. 



A visual analysis of Figure 5> however., reveals that the co/itwst 
group students* cuV^ve was skewed more positively than that of the 
experimental groufj "students, with a greater percentage of control 
.group students describnng their progress as poor, and with a smaller 
percentage of contrast group children labeling their 'progress as 
great. . • * 

Insert Figure 5 about here 



' The remaining questions on the End-of-the-year Student Interview 

asked (a) whether students knew their, reading goals. and, if so, could 

they state theni,.(b) whether they would meet their reading goals this 

year; ^ and (c) how they " knew whether they would meet their goals. 

There was a statistically' significant 'relation between measurement/ 

evaluation treatment and whether students said they knew their goals, ^ 

x2(l) = 4..17, £ < .05, with experiment^ls typically saying they knew 

their goals and with contrasts typically saying they did not know 

their goals. Further., there- was a statistically significant relation 

between treatment group and students actually sjatinig their goals, 

x2(l) r 3*91, £ .< .05, with expeVimentals frequently stating their^ 

'4 - ^ 

goals: and with contrasts infrequently stating their igoals. 

' . ' There was no statistically significant relation between 

treatment and whether students stated that they thought they would 

meet their goa-U. Yet, the acoiracy of these statements "was,, related 

to treatment group,. x2(i) = 6.61, £< ^01, with a greater percentage 

of experimental group statements correct. ^ Additionally, when asked 

01' 
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how tiiey knew whether they would meet their goals, experimemtal 
students tended to say that they relied on their graphed data whereas 
contrast students tended to say that they "just thought so/i x2(l) 
9.47, £ < .005- ' ' * . 

Discussion 

The purpose of* this study* wai? to determine the * educ-ational 
effects of technically adequate, repeated curriculum-based measurement 
and 'evaluation^ procedures ♦ Using these procedures, the experimental 
teachers were (a) to measure and graph students' proficiency in 
reading aloud from text passages at least twice weekly, and (b) to 
compare, at frequent intervals, the students' actual increases in 
proficiency with increments required to achieve >ttie level of 
proficiency, specif ieid in the lEP goal* Wh6n students' actual progress 
trends compared unfavorably with their required increases, teachers 
were to introduce a new dimension into the students' programs in an 
attempt to stimulate better student * progress. Comparison teachers 
employed typical special education measurement and* ' evaluation 
procedures. ... 

The/ findings indicated that experimental teacher? implemented 
their treatment, moderately well for most of the treatment period. 
Although teachers conformed to the^ prescribed measurement Schedule 
fairly well during the first 12 weeks of the study, their rate of 
measuring fell during the last six w6eks of the treatment. 
Analogously, the mean number of changes introduced by teachers into 
students' programs dropped sharply during the last trimester of the 
study. These findirtgs corroborated prior research (Tindal et aU, 



1981) , which* suggested that teachers tend to measure students less 
frequently and evaluate data less* Vesponsively than prescribed. 
Nevertjieless,, the extent of - teachers* iniplementation was adequate 
?luring mp.st of this study, and their accuracy in implementing 
experimental procedures remained fairly stable across the entire 
treatment period. * . 

The results indicate that implementation of the frequent, direct 
(Tieasurement and evaluation procedures affected pbsitively both stulient 
achievement and student awareness of their own achievenient. Students 
of experimental teachers performed better than students of comparison 
teachers on virtually all achievement measures, .rate and accuracy in 
reading aloud from text , materials, and the Structural Analysis and 
Re^ading CotnpVehension subtests of "the Stanford Diagnostic Reading 
test . It is worth noting that while the former measure (reading aloud 
from text) was used throughout the study, the latter two were not 
directly measured by the teachers- as part of the evaluation ,sys1:em. 
The findings suggest that when teachers repeatedly .employ the simple 
one-minute test of reading aloud from text passages ^to index student 
progress, they can interpret student gains as representing general 

"reaiing achievement; that is, the data' apparently validly reflect 
fluency, decoding, and comprehension. the only measure on which 
experimental and contrast group performance was undifferentiated was 

:^error scores on the third grade oral reading passages. This may be 
explained by the poor reliabili.ty of such error scores (Deno, Mirkin, 
Chiang, & Lowry, 1980; Fuchs & Deno, 1981; Fuch?, Deno, & Marston, 

X * 

1982) . Consequently, the results of this -studyv suggest that 



technically adequate^ repeated curriculumrbased measurement, when used 

by teachers to evaluate and modify programs, positively affects 

student achievemerft. 

In addition to achieving better, the students in this study were 

^more knowledgeable about their own learning When their teachers used 

systeTmatic foea^rement and evaluation procedures. As compared to 

pupils whose progress was measured .and evaluated via conventional 

special education practice, students ^who were measured and evaluated 

repeatedly, and systematically in their curricula (a) tnore frequently 

• • " >. * 

said they knew their goals, (b) more of,tei1 actually stated their 
goals*, (c) were^more accurate in their estimates of whether they would 
meet their goals, and (d) more typically reported that they relied on 
data to. formulate estimates of whether they would meet their goals. 
>The results revealed .that repeate'd, direct systematic measurement and 
evaluation enhances .not only students' reading achievement,, but .also 
their knowledge concerning their own learning. These outcomes are 
theoretica-lly and socially important. . On the one *hand, they support 
the' hypothesis of many edt^tional^ psychologists (Banjkjp^, 1982; Crow 
& Crow, 1963; Farnbam-0iggory,\*197?; Prentice, 1961)*, thai 'students; 
.knowledge of their learrving may improve academic ^Performance. On the 
other hand, increased participation by . students, in their own 
education, itself, is often regarded as an important educational goal* 

This study generally provides support for increased - use of. 

• * .- , - ■. - ^ ' 

systematic, on-going, measurement and evaluation of student progress' by 
teachers^ It provides evidence that 'individuaj sfpecial education 
programs can Jbe monitored continuously and improved.^as requifed fo 



increase the likelihood of student gains. The findings of this 
research contradict the conventional argument that teachers do, not 
need to use frequent measurement and evaluation because (a) they a>;e 
already sufficiently aware of "student ^chiavement; and .(b) such 
procedures are inefficient. Teachprs ' who used frequent measurement 
and systematic evaluation were more effective in enhancing student 
growth &nd student awareness of their educational programs. 
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Table T .• ' * 
Cleans ^nd Standard Deviations on thfe Accuracy of Injplementation 
Rat'ihg Scale (AIRS) Durirrg Three Data Collection Times^ 



Time 1 , time,2 " Time 3. 

Items • Sa. X SD X ^ SD 

•Admirvi strati on of Task ' ^4.78 .43 • 4.83 i .38 4.78 .64 

Sampling Instructional Level 3.11 1.32 3.28 ,1.41 3.28 1.41 

<$aking Baseline ' 3-.56 1.20' 3.50 1.15 3.50 1.15 

Graphing 4.28 .57 4.17 .51 ' 4.17 .51 

Drawing Aimline ^ ' ■ ■ v. ,4.44 1.2^ 4.44 1,.29 4.44 1..29 

Writing Goal 4.94 '.24 4.94 , .24 4.94 ^.24 

• Writing Objective 4.78 .94 4.78 . .94 4.78 .94 

" Describing Measurement System 5.b0 .00 5.00 .00 4.89 .47 

Measurement^ • 4.30 .39 4.36 .38 ' 4.28^ .33 

WritinJ instructional Plan 2.11 1.32 2.67 1.28 2.64 1.36 

^ ... 1 (N=ll) 

Creating Substantial Changes 3.31 .75 3.94 ^ .85 3.00 2.82 

• . (N=13) (N=16) . (N=2) 

Describing Clear Changes 2.77 1.17 2.88 1.20 5.00 .00 

(N=13) (N=16). (N=2) 

timing Instructional Changes 2.61 1.58 3.83 .92 3.35 1.69 

• . . • (N=17) 

Selecting'Stimulus Material ■ 4.61 .70 / 4.9T ^ .24 4.89 .64. 

Overall Total^ 3.32 .62' . 3.66 * .52 3.65 .68 

^N^l? teachers, except as indicated fn parentheses. ' ■ .,f 

^The measurement group combined eight items re.lated to establ ishing .tlje 

measurement system and conducting measurement. • 6 

^The overall total was based upon the addition of the measurement group 

and each of the remaining five items. ^ , 
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•Table 2 - . ^ , . « ' 

Number of Changes Introduced into ExperimentaT Studertts|/-^ 
^ Programs During Each Third of the Treatmertt Period \^ 



First 
Third 



Secohd 
Third 



Tf^ird 
Thi rd 



FreqMency of 0 Changes 


21 


15 


53 


{frequency of 1 Change • 


, -26 


35 


6 


Frequency of 2 Changes 


■"'n. 


10 


4 


Frequency of Z Changes • ' 


■2 


3 


0 •• , 


Mean Changes 


.95 


1.01 . 


.22 


Standard Deviation 


.83 


,77' ' 


' ,. .59' 


Median Changes ' 


— ^— ^ ' 


1 


0 
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Table 3 • . ' , 

Means, Standard Deviations, Adjusted Means, and ANCOVA Results on Four' nependent Variables , 



^ ^ — ^ ^ — ^ ^ 




Measurement/Evaluation Treatment 
Experimental ' ' • Contrast 


< 


F *^test 




• \ 


X 


SD , Adjusted X 


X 


SD 


Adjusted Y 




F(l,32) 




Words correct 


70.23 


45.75 


78.93. 


, 5r.29 


38.18 


43:81 


t 

4.221 


.048 


^per minute 


















Errors per"* 


5.63 


2.08 ' 


5.21 


5.64 


^ 2.75 


'5.99 ' 


.OOl' 


.982 


minute 


















SA 


39.79 


12.28 


40.74 


29.65 


15.34 


2^.84 ' 


7.194 


.011 . 


RC 


43.95 


10.52" 


.45.37 


33.02, 


15V39 


31 .81 


y 4.222 


.048 
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Figure.!. Example of a graph of studenlf' performance data. 
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GOAL- In 



19 weeks 



i school weeks until year's end 



, when provided with 



stories from gr^ade level 2 - SRA passages , Michael 

(Level ^f, series) 



will read aloud at the rate of 85 

(wpm correct) 



(student's nam,e) 
, with no more 



than 8 



errors t 



OBJECTIVE Each successive week, when presented with a random* selection ^ 

from Grade level 2 > SRA passages the student will rea<^ 

(same level ^ and series as above) : . 

aloud at an'average increase of 2.6 wpm and no increase in ^ 



errors. 



Figure.2, Mjchael's .Goal and- .Objective Form * 
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Number of Correct Words. (•) and Errors (o) Per Minute Read% Michael from Pages 
in SRA', Level 2 Across Time, Under Baseline (A) and Three Instructional Strategies 
(B, C, and D). , 
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Instructional Change Form 



4sk 



..^^ Instructfijina) J'roceduresr. 


Arrangement 


Time 


^ Materials,, ^ 


Motivational Strategics 


.Orgl Reading Practice 
' Comprehension exercises 


Group (1:5) 


' ' . 

45 minutes 
• 


Double Action Short Story, 
" • • Part 2" 

Story Writing S class " , 
discussiort ' . 


Genera tijng own stories , 

t ♦ ' (,* 

c 


* ^ Language Experience ^ , 
,;. ; ^ .Approach ^ ^ '\ 


Individual 
^-with para-^ 
proiessiojia i 


same 


Studgnt^s own 'stories 
File cards., " ; 
Story Folder 


same - * . 


Language- Experience 

l^^ing Comprehension 
r Activities A_ ^ 


' Individual" . 
.with para-r 
profess iphal 

Individual 
• with teacher- 


20 minutes, 
20 minutes 


. ^ - ^ -J — 

See above 

McCall-Crabbs,' Book •£ / 
SRA kit 


same ' ~ - - - ■ 

^ * * 

individual arrangement 
witFteacher. • 



t 



. / - - 



4i 



Figure 4. Michaers Instructional Change Form-:* 
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' S-tudent Descriptions 



Figure 5. Percentage of Student Desciriptions of Progress During 
the Current Academic Year. 



'ERIC 



42 



o o 



o 

Q.UJ 
C 



Q. 4^ C 

c c <- 

• o * 
ql cm • 

(/> c u 
C (0 0) 

1-^ x: QC 
o 



School :^ 
Date: 



Appendix A, 
Accuracy of Implementation Rating .Scale 

Student: 



Teacher: 



Observer (Items 1 and 2):_ 
.Rater (Items 3-13): 



^Number of ob^ervatlojis prior to rating: 

Time observation b.eglns: Time observation ends:^ 

Time allocated to reading in^tructipn per day: 

/Currlculunt used for measurement: Publisher 

Seri es Level ^_ 

Instructions . . ' 



-.- •. - 

Circle the number that, accurately reflects your rating for each , 
variable. Only one number may be circled per variable. 1 reflects a 
low level of Implementation and 5 means total Implementation of the 
Procedures to Develop and Monitor Progress on lEP Goals. See Opera tlon- 
"al Definitions. Items 1 and 2 require 'direct observation of the measure- 
ment administration. ttems"3, 4, 5. 6, and 7 require Inspectlon-of the-^^ 
student graph. Items 8, 9, and 10 require Inspection of the student's 
lEP form. The InstructionaVl^lan must be Inspected to rate item 11. 
The Change Record must be Inspected torate Items 12 and 13. 



1 . 
.2. 

^2. 
4. 

s: 

6. 
7. 

~8. 
9. 

IID." 

T2. 
13. 



Administering the Measurement Task 
Selecting the Stimulus Material 

Sampling for, Instructional Level 
Basel In.e- 
^Gfaph Set-up 

Aimllne ^--^ 
Timing of Instructional Changes 

Lbng-Range Goal 
Shor]t-Term. Ol^jectlve 
Measurement System 

Instructional Plan 

Substantial Changes 
One, Clear Change 



2 


3 


4 


5 


2 


3 


4 


5 


•2 


.3 


4 


5 


2 


3 


4 


5 


2 


3 


. 4~ 




2 


, 3 


,4. 


5 


2-. 


3 


4 , 


5 


2 


3 


4 


5 


2 


3 


4 


5 


2 


3 


, 4 


'5 


2 




4 


5 


2 


3 


4 


. 5 


2 


3 


4 


5 
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AIRS 

Operational Definitions. 
^ Accuracy of Implementation Rating Scale 
Administering the Measurement Task 

5 - The measurement task is administered correctly: teacher 
brings stopwatch and pencil to measurement area; gives 
correct d irec ^ns for the task; administers the measure- 
ment proceatTre for one minute; correctly marks the teacher 
copy; correctly <counts words correct and incorrect; cor- 
rectly counts words correct and incorrect; corrfectly 
plots the data point.. 

1 - the teacher: forgets necessary materials; does not give 
. directions; does not time the task ^accurately; fails to 
mark the teacher copy or incorrectly marks errors; miscounts 
, correct and incorrect words; and inaccurately plots the data 
point. 

Selecting the Stimulus Material 

5 - The teacher has followed these procedures: Uses passages 
selected from thV l^veJ .that represents the annual goal. 
Observers should record the book- f^^om -.which the passage 
was selected and later check this with the long-range goal 
level. *At this level find the pages ,in these stories that ^ 
do not have e;^cessiv5^dialogue< indentations, and/or unusual 
pronouns. Write these page numbers on equal size slips of 
•paper. ^ 

- Put the slips of paper into a drawbag and shake it. 

» " 

- Randomly picV a slii5 of paper. 

- The page number chosen is the page where the student^ 
begins reading. If tje page chosen is a passage that' 
was read earlier during the week', draw another page 
number. 

Other completely random procedures are also rated a 5. . If, 
however, not all passages have an equal chapce of being 
selected, a 4 rating would be indicated. ' , 

1 - The teacher fails to randomly pick the passage or the Sample is 
taken from a domain^Hhich is greater or smaller than the one 
indicated in the goaK 

Sampling for Instructional Level 

5 - The teacher has sampled from higher or lower reading 'levels 
to 'find the level in which the student reads 20-29 wpm 
' (grades 1 & 2) or 30-39 vypm (grades 3 and up). 



1 - The teacher is measuring at a level which is too high or . 
too low. 

Baseline • ^ 

5 - The student's performance has been measured at least 3 tlines t 
estailish a stable baseline. A stable baseline ftieans thaA all 
data points fall within a range of lO;* 

1 - the teacher ha? not found a level for which a stable baseline 
has been established or h^s failed to collect 3 data points 
during the baseline phase. 

Graph Set-Up 

5 - The gr^aph is accurately set up: The dates filled in on the 
horizontal axis; the vertical axis is correctly labeled words 

read per minute from material-, the units of measure- 

ment'are specified; the student's name and-subject area are 
certified; a key identifies the symbols for correct (.) anxJ 
incorrect (x); symbols are placed at the' intersection of date 
and score; the data points'are connected with straight lines; 
and absences are recorded on the graph as (abs. ). 

1 - The graph does not include many of the Items mentioned above. 
Aimi ine ' . ^ 

5 - The long-range goal is marked on the graph with an X at the 
intersection of ^he desired performance level and date of 
attainment and^a line. of desired progress connepts the 
point representing the student's median score of the last 
3 data points from baseline and the LRG. 

1 - The long-range goal is not marked on the graphand/of the 
median and LRG are not connected. • ' 

Timing of Instructional Changes 

5 - All the adjustments in the student's program are made at the- 
appropriate time give<» tlje rules for data utilization: 

(1) Compaje the actual slope based on 7 -to 10 data points 
to the. slope required to attain the Annual Goal. 

(2) if the actual sloipe is equal to, or steeper than, the 
Annual Goal slope, continue the program. 

(3) if the actual slope is flatter than, the Annual Goal 
slope, change the program. 

1 - None of the adjustments in the student; s program are made 
at th.e appropriate time. 
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8. Lo,ng-Ranqe Goal . . ' 

' 5 - The long-range goal is accurately written; goal specifies - • 

tfie number of weeks until next review; stimulus materials 
for the goal represents the level in which the student 
is performing at entry level criterion; goal specifies 
student behavior; goal specifies mastery criterion of 
50 wpm with fewer than 5 errors (grades 1 & 2) or 70 wpm 

with fewer than 7 errors {grades 3r5) when there are 36 # 
weeks until the annual review. If there are fewer than 36 
weeks, the criteria can be lowered proportionately. 

1 - The long-range goal contains none of the above criteria. 

9. Short-Term Objective * , # 

5 - The short-term objective is accurately written; stimulus ^ 
material and behavior ts specified; and the average increase 
\n performance is the desired performance fliinus the actual 
performance divided by the number of weeks until the annual 
review. 1 

1 - The short-term objective contains none of the above criteria* 

10; Measurement System 

5 - The teacher has indicated how the material is organized, the i 
frequency of measurement, and what is to be recorded on the 
graph. ' ^ 

1 - The measurement system is not specified. 

IK* rn§t;ructional Plan ^ ^ i 

5 - The' instructional plan includes clear and specifi*c descriptions 
of the instructional procedures, th^ time spent in each acti- 
. vity, the pertinent materials, the arrangements, and the 
motivational strategies 

' ' . i 

1 - The instructional plan is unclear and lacks specific descrip- 
tion^ of the instructional procedures, the time spent In each 
activity, the periinentVmaterials, the arrangements^ and the ' ' 
' motivational strategies/ 

12» Substantial Changes ► , ' ^ I 

♦ 

5 - The adjustments in the student's program are always substantial 
(have a good chance of being effective^ see Unit XIV). 

• T - The adjustments are never substantial. | , 
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13. Cigar Change 



5 - All the adjustments made introduce only one, clear program . 
change. • , . 

1 - All the adjustments made introduce more than one chan'ge 
and/or the change is unclear. ■ ^ • ., . 
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GOAL In 



Appendix C , 

• eOAL AND OBJECTIVE FORM .■ ' _ 

, .when presented with stories from 



{S school weeks until year's end) 



(Level vy series) 
rate of 



(student* s name) 

with no more than 



wil 1 read aloud at the 
errors. 



(wpm correct) ~ * , 

OBJECTIVE Each successive week, when presented with a random selection fro^n^ 

, student will read aloud at 

(same level # and Series as above) 

wpm and no increase in errors. 



an average increase of 



MEASUREMENT SYSTEM Frequency: at least 2X/week; stimulus format: oral 
reading passages; Test administration procedure: same as manual; Scoring 
procedure: .same as manual; charting conventions: same as manual'. 
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student. 
Name 



lEP Gdal 
Area 



Changed jyi the liistrisctioiml Phn 



Tiine Available . 
foHnstruction 
Location 



of Instruction 



- -~ 1 n-str4JGtiona^-r — . V - ^ 

Procedures ' 


Arrangement ' 


Hme ' 


Materials 


Motivational 
■ Strategies 


DATE: 






• 




DATE; * ' • . 

* < • 


> A 


♦ 






DATE: ' " ^ ' 

4 ' 






1 




DATE: 

f 

• • 

i \ ' ' 

» / 


» 






» 


DATE: 




f * 
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Appendix E 

. tND^OF- YEAR-STUDENT INTERVIEW 

Student's Name Date_ 

Data Collector's Name . 



1) What kind of progress Have you made- in^ reading this year? 

□ n . 

Poor Not-so- Pretty Rood ^ Gredt Don't 
Good * Good Know 

2) Do you know* what your goal is i*n reading for this year?' 
If yes, can you tell it to me. '(Write below) 



□ □ ^ 
Yes No 



3) 'will you meet your reading goal this year? ^ q 

' Yes No " ' 

4) How do you know if^yoO'are goi^ig- to meet your reading goal this year? 



Student's Accuracy * " - ^" . ' ^ - 

Please .indicate the accuracy of the student's responses by ans.wering 
the following questions: * ^ 

]) Did the student name correctly hi$/h^>eading goal? - j— Q ' ^^^^ 

"v/ * . .Yes No \ 
If no, how was it wrong? - ' ^ ^ ' ' ; 
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Z) Wd the. strident i-ndicate correctly whether he/she will. meet his/her 

goal in reading? - * ^ • 

□ P ♦ 

Yes^ No ' . . 



If no, what- was wrong? ^ 

a' 



' OP 



\ 




Teacher Trainer^ 

Teacher 

School 



' Appendix F ^ ' - 
Sample Ejcperimental Training Schedule 

Date . 



v,^ ' Below, p,lease: (a) describe the nature of the^ training yau. have provided 
this teacher during the study, (b) indicate <the administrative arrangement 
untler which the training was provided (individual , 1-3 ratio,, etc, ) , and 
(c) estimate the number of hours. yau" spent with, the teacher on each training 
area. , " ^ 



/ 


DESCRIPTION OF TRAINING 


ARRANGEMENT " 


vNUMBER OF HOURS 


• 

1 

t - • 


— . — ■■ ' t : — — 

data-bSsed monitoring procedures 


. individual ' . ^ 




i 

\ 

1 


meeting the* iniSlividual. heeds 
of the students* 


individual 


1 hr.". . . 


*> 

• 


teaching decoding and ' . 
comprehension (literal and . • 
inferential) skills' 


— — ■ ' - — ' ' 

individuals^^ 

• • 


t 4 

3 hrs. 


• 


classroom management for 
reading - 


-T-! \ 

individual 


1 h'r. 




■ — • ■ — — ' 

■5 




• 

*• 








• 

« 
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Sample Control Training Schedule 

Teacher Trainer > - Date 

< ' " <■ , * ^ " ^ 

Teacher^ , * ^ * . 

.School . *^ ' 

• • ' ' ^ • , 

BeTovf, please: (a) describe the nature of the training you have provided 
this teacher during the^study, (b) indicate the administrative arrangement 
jjnder which the trainifig was pVovided ,(individual , 1-3 ratio, 'etc, )^ and 
(c) estimate the number of hours you spent wfth the teacher on each training 
area. ^ - * ^ ^ " 



DESCRIPTION OF TRAINING 


ARRANGEMENT 


NUMB£R OF HOURS 


Jsing the language ' ^ 
experience approach 


indiVidual^, , 


— r^~' 

, 10 Hrs. * 

* 1 


Individualizing 

r 


i*ndivi»dual \ • . 


\ 

1 

5 ]hrs 


Jsing audio-visual equipment 
for teaching and motivating 


individual 


^2 hrs. 


Diagnostic assessinent tools 

1 


individual ^ 


•3 hrs; 


t 


* ■ > 




• 

• t 
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